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SPATIALLY EXPLICIT NON-MENDELIAN DIPLOID MODEL 

By N. Lanchier and C. Neuhauser 

University of Minnesota 

We introduce a spatially explicit model for the competition be- 
tween type a and type b alleles. Each vertex of the c?-dimensional 
integer lattice is occupied by a diploid individual, which is in one of 
three possible states or genotypes: aa, ab or bb. We are interested in 
the long-term behavior of the gene frequencies when Mendel's law 
of segregation does not hold. This results in a voter type model de- 
pending on four parameters; each of these parameters measures the 
strength of competition between genes during meiosis. We prove that 
with or without a spatial structure, type a and type b alleles coexist 
at equilibrium when homozygotes are poor competitors. The inclu- 
sion of a spatial structure, however, reduces the parameter region 
where coexistence occurs. 

1. Introduction. Diploid organisms carry two sets of chromosomes, which 
are segregated during meiosis to produce haploid gametes. Mendel's law 
of segregation states that pairs of homologous chromosomes segregate ran- 
domly during meiosis and are distributed to gametes. This implies that a 
gamete of an individual who is heterozygous at a locus will receive one or 
the other allele with equal probability. This implies, for instance, that 25% 
of offspring of two individuals that are both heterozygous for the same pair 
of alleles will be homozygous for one allele, 25% will be homozygous for the 
other allele and 50% will be heterozygous. 

This law does not always hold. Some organisms have chromosomal el- 
ements that show a higher than expected rate of transmission. A widely- 
studied example of meiotic drive can be found in the house mouse (Mus 
musculus). The house mouse may have a variant form of chromosome 17, 
called i-haplotype, which is transmitted to more than 90% of its offspring. 
Such systems that show a higher than 50% representation in their offspring 
are called meiotic drive systems. 
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Simple mathematical models readily show that meiotic drive by itself 
results in fixation. Yet natural populations show a low frequency of the t- 
haplotype [10-15%, see Durand et al. (1997) and references therein]. Two 
factors are contributing to the lower than expected frequency: male offspring 
who are homozygous for the i-haplotype are sterile (which by itself precludes 
fixation) and most t-haplotypes carry additional recessive lethal mutants. As 
Bruck (1957) showed, however, these two factors do not sufficiently explain 
the observed low frequencies. Meiotic drive has been observed in a number of 
other systems [see articles in Lyttle and Perkins (1991)], including the fruit 
fly Drosophila, mosquitoes, the fungus Neurospora, humans and maize. In 
order to detect meiotic drive in natural populations, a polymorphism must 
be present. 

Lewontin and Dunn (1960) used computer simulations to suggest that ran- 
dom genetic drift due to small population size lowers Bruck's predicted fre- 
quency of the t-haplotype further. Other simulations with varying assump- 
tions were conducted over the years [Levin, Petras and Rasmussen (1969); 
Petras and Topping (1983); Nunney and Baker (1993)], which are reviewed 
in Durand et al. (1997). Durand et al. (1997) concluded that "[N]one of the 
work surveyed here has fully explained the existence of the low frequency 
t-polymorphism seen in nature." They then posit a number of hypotheses 
to explain the low observed frequency, including low levels of migration in a 
spatially structured population, selection against the t-haplotype and lower 
transmission ratio distortion in wild versus laboratory populations. To test 
these hypotheses, they built a spatially structured simulation model. They 
found that "migration rate is the single most important factor in determin- 
ing the equilibrium frequency of t-haplo types." However, the model only 
yielded observed low frequencies for a very narrow range of parameters, and 
therefore essentially failed to provide an explanation for the low frequency of 
t-haplotypes in the wild. Their extensive simulations of this spatially struc- 
tured model led this group to suggest that "the basic concept of a stable, 
low frequency t-polymorphism may be wrong." Instead, there might be two 
stable states, one in which the i-haplotype is absent and another where it is 
at high frequency (as in unstructured populations). Environmental change 
may be responsible for populations switching between the two states. They 
concluded with "[I]f these [transitions] occur frequently, many populations 
moving towards a new equilibrium (either i-haplotype extinction or pan- 
mictic frequencies) will be observed. The allele frequency measured in such 
populations would be low but transient." 

Spatial simulations are difficult to interpret and we would like to caution 
against drawing conclusions that are not supported by rigorous mathemati- 
cal analysis. Rigorous results are typically restricted to specific models, but 
generalities have emerged that allow us to speculate with some confidence 
on the behavior of models. While simulations might indicate the absence of a 
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stable polymorphism for realistic parameter values and instead a presence of 
multiple stable states, we propose that a spatially explicit model on meiotic 
drive will either have a locally stable equilibrium, that is, a nontrivial sta- 
tionary distribution leading to a stable polymorphism, or fixation will occur, 
where the fixation states correspond to the multiple stable states of the non- 
spatial model. However, we suggest that the model will not alternate among 
the fixation states (except in degenerate cases). This assertion is based on 
the observation that if a nonspatial model exhibits multiple, locally stable 
equilibria, it can often be shown that the corresponding spatially explicit, 
stochastic model will eventually converge to only one of these states and that 
all other fixation states are metastable [see, for instance, Neuhauser (1994)]. 
In addition, one- and two-dimensional spatial stochastic models often exhibit 
clustering. These clusters are not fixed in space but move around, and thus 
may give the impression of local transience. The model we now introduce 
appears to exhibit these commonly observed behaviors as well. 

Our stochastic model is a continuous time Markov process in which the 
state at time t is a function — ► {aa,ab,bb}. Each site of the d- 
dimensional integer lattice is occupied by an individual with rjt(x) indicating 
the genotype (aa, ab or bb) of the individual present at site x at time t. Each 
of the two genes at site x gives birth at varying rates to a gene of its own 
type that then replaces one of the Ad genes picked at random among the 2d 
nearest neighbors. More precisely, the evolution at site x is described by the 
transition rates 



aa 



ab at rate 2(j) bb ^ t{q(z) = bb} + <p ba ^ t{r](z) = ab}, 

\\x — z\\=l — 2|| = 1 



ab^-bb at rate fob ^-{vi 2 ) = bb} + ^ l{r](z) = ab}, 

\\x—z\\=l \\x—z\\=l 

bb — ► ab at rate 2(j) aa t{ri(z) = aa} + 4> ab Y~] l{rj(z) = ab}, 

\\x—z\\=l z||=l 

ab — > aa at rate 4> aa ^2 ^-ivi 2 ) = aa \ + ^{v( z ) = a b}, 

\\x—z\\=l jz||=1 

where \\x — z\\ = sup i=1 2 d \xi — Zi\. For i,j £ {a, b}, the parameter is 
the birth rate of alleles of type i present in sites occupied by individuals 
of genotype ij. To understand the first two rates, note that an individual 
of genotype bb produces genes of type b at rate 2(fi bb while an individual of 
genotype ab produces genes of type b at rate 0& a . If the gene (of type b) is 
sent to an individual of genotype aa, then it will replace one of the type a 
genes which always results in a transition aa — > ab. But if the gene (of type 
b) is sent to an individual of genotype ab, then it will replace one of the genes 
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chosen at random which results in a transition ab — > bb with probability 1/2. 
The last two rates can be understood similarly by exchanging the roles of a 
and b. 

The mean-field model. Before investigating the consequences of the in- 
clusion of a spatial structure in the form of local interactions, we will look at 
the mean-field model [see Durrett and Levin (1994)]. The mean-field model 
can be obtained from the spatial model by assuming that individuals are 
located on the set of sites of a complete graph with N vertices (i.e., each 
site has all other sites as its neighbors which implies that there is no spa- 
tial structure), by dividing the parameters <pij by the neighborhood size N 
and by taking the limit as N tends to infinity. This results in a nonspatial 
deterministic model in which the densities of genotypes evolve according to 
a system of differential equations. More precisely, by letting u aa , u ab and 
u bb denote the densities of individuals of genotype aa, ab and bb, respec- 
tively, the mean-field model is described by the following coupled system of 
ordinary differential equations: 

du 

( 1 ) — IT = { 2( t>aaUaa + 4>abUab)Uab/2 ~ {%(p bb U bb + <Pba,U a b)Uaa, 

dt 

= (2(j) aa U aa + 4>abU a b)(u bh - U ab /2) 

(2) 

+ (2(p bb U bb + <j)baUab)(u aa ~ U ab /2), 
(3) = {2(p bb U bb + 4> ba U a b)u ab /2 - {2<j) aa U aa + 4>abU a b)ubb- 

Let tp = (cj) aa — 4> b a)(4>bb — 4>ab)- When tp < 0, the system has exactly two 
fixed points, the two trivial equilibria u aa = 1 and u bb = 1. By assuming 
that ip > 0, we find an additional (nontrivial) equilibrium given by 

4> 2 U aa = {(f> bb - (frab) 2 , 
<j> 2 U bb = ((paa -4>ba) 2 , 
(j) 2 U ab = 2((j) aa - (j) ba )((j) bb - (j) a b), 

where 4> = (^a — <t>ba) + (<frbb — 4>ab)- In Section 2 we will prove the following 
results: 

1. The trivial equilibrium u aa = 1 is locally stable if and only if cf> aa > <p ba . 

2. By symmetry, the equilibrium u bb = 1 is locally stable if and only if cp bb > 

4>ab- 

3. The interior fixed point (tp > 0) is locally stable if and only if cp aa < cp ba 
and <p bb < <p ab . 

See Figure 1 for a summary of these results. 
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Fig. 1. Phase diagram of the mean-field model. The vertical and horizontal axes represent 
the possible values of the rates (b aa and cj>bb, respectively, ranging from zero to infinity. In 
our picture, the rates 4>ab and cf)t,a ore constants whose fixed values are indicated at some 
arbitrary points. 



The spatially explicit model. To understand the role of spatial structure 
caused by local interactions, we now return to the spatially explicit model 
and provide comparisons with the nonspatial model. We refer the reader 
to Figures 1 and 2 that show the phase diagrams of the mean-field model 
and the interacting particle system, respectively. In Figure 1, the region 
labelled gene a indicates the set of parameters for which the density u aa 
converges to 1, the region coexistence the set of parameters for which there 
is a locally stable interior fixed point, and the region founder control the set 
of parameters for which the interior fixed point is unstable. In Figure 2, the 
region labelled gene a indicates the set of parameters for which the process 
starting with infinitely many individuals of type aa converges to the "all 
aa" configuration and the region coexistence the set of parameters for which 
there is a stationary distribution with a positive density of both genes. 

In the case when <j) aa and fob are small, the particle system has a sta- 
tionary distribution with a positive density of genes of type a and b, which 
is symptomatic of the existence of a locally stable interior fixed point for 
the mean- field model when qb aa < fo a and fob < (j) a b- This indicates that 
coexistence occurs for both models. 




On the other hand, in the case when and 4>b a are small, coexistence 
is not possible. This, however, translates differently for the spatial and non- 
spatial models. In the spatial model, we believe that, for almost all the 
parameters, there is fixation of one of the genes. Provided that the process 
starts with infinitely many genes of both types, the "winner" depends on the 
parameters of the model (see Figure 2). Simulations suggest that, along the 
transition curve, clustering occurs in any dimension (see the right-hand side 
of Figure 3 for a picture in d = 2), and that the three regimes meet at point 
4>aa = 4>ba and (ftbb = 4>ab as indicated in Figure 2. In contrast, the mean- 
field model exhibits founder control when <j) aa > cpba and 4>bb > 4>ab, that is, 
the interior fixed point is unstable and the "winner" depends on both the 
parameters and the initial densities (see the right-hand side of Figure 4). 

Numerical simulations also indicate that the set of parameters for which 
coexistence may occur is sensibly smaller for the spatial model. We will 
prove rigorously the existence of a set of parameters for which the interior 
fixed point is locally stable for the mean-field model whereas coexistence is 
not possible for the 1-dimensional stochastic model. This reduction of the 
coexistence region of the particle system when compared to the mean-field 
model has already been proved theoretically for a spatially explicit version 
of the Lotka-Volterra model [see Neuhauser and Pacala (1999)]. We now 
state our results in detail starting with the coexistence result. 
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Theorem 1 (Coexistence). Assume that (f) a b = <j>ba> 0. If 4> aa and fob 
are sufficiently small then there exists a translation invariant stationary dis- 
tribution v such that u{r]{x) = ab) > 0. The same holds when (f) a b < 2cpb a an d 
4>ba < 24>ab i n the 1-dimensional case. 

See the left-hand side of Figure 3 for a picture. In the case when cp a b = 
4>ba = 1 and 4> aa = (ftbb = 0, the process rjt reduces to an annihilating branch- 
ing process (ABP) in which = homozygote of either type and 1 = het- 
erozygote, that is, the state of each site of the lattice flips from to 1 and 
from 1 to at rate equal to the number of neighbors in state 1. The main 
ingredient to prove Theorem 1 is a rescaling argument that compares the 
ABP viewed on suitable length and time scales with a 1-dependent oriented 
percolation process on the lattice 

Q = {(z, n) € Z 2 : z + n is even and n > 0}. 

More precisely, by defining occupied sites in the lattice Q as the set of sites 
(z,n) € Q such that there is at least one particle (state 1) in the spatial cube 

Nzei + [-JV/4, N/4) d = (Nz, 0, . . . , 0) + [-N/4, N/A) d 

at time nT, we will prove the following theorem. 




Fig. 3. Spatially explicit model with nearest neighbor interactions at time 50 on the 
400 x 400 square with periodic boundary conditions. White (respectively, black) sites refer to 
homozygotes of type a (respectively, b), and grey sites to hetero zygotes. Left: <j> aa = 4>bt = 4 
and 4> a b = 4>ba = 5. Right: (f> aa = <j>bb = 5 and <fi a b = (j>b a = 1. In both pictures, the process 
starts with a Bernoulli product measure, half of the genes being of type a. The picture on the 
left shows the process at equilibrium but not the picture on the right in which clusters grow 
in time indicating that the only equilibria are the "all aa " and the "all bb " configurations. 
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Fig. 4. Solution curves of the mean-field model. The parameters are given by <f> aa = 1, 
4>ba = 3, <f>bb — 2 and (j> a b = 4 (left), and <f) aa = 4, <f>ba = 2, 4>bb = 3 and a b = 1 (right). 

Theorem 2 (Annihilating branching process). For any e > 0, £/ie pa- 
rameters ./V and T can be chosen in such a way that the set of occupied sites 
dominates the set of wet sites in a 1-dependent oriented site percolation 
process on Q in which sites are open with probability 1 — e. 

This implies coexistence of both genes (survival of the heterozygotes) 
when 4> aa = cftbb = 0. To conclude the proof of Theorem 1, we will apply a 
standard perturbation argument to extend the result to a region where (paa 
and 4>f,b are positive but small. 

The last three theorems provide conditions under which there is fixation 
of the gene of type a and so extinction of the gene of type b. Here, "fixation 
of the gene of type a" means that the process converges to the "all aa" 
configuration. The reader will note that, by symmetry of the evolution rules 
of the process, the conditions under which there is fixation of the gene of 
type b can be deduced by reversing the roles of genes of type a and b. The 
following results hold for the process starting with infinitely many genes of 
type a or infinitely many individuals of genotype aa. We can weaken this 
condition and assume instead that the process starts with finitely many 
genes of type a or finitely many individuals of genotype aa, and conclude 
that fixation of the gene of type a occurs with positive probability. 

Theorem 3 (Fixation). Assume that mm((j) aa ,(j) a b) > max(^ a , qb^) and 
that infinitely many genes of type a are present at time 0. Then there is 
fixation of the gene of type a. 
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Theorem 4 (Fixation). Assume that <j) a b is fixed, 



(4) 4>aa > \l + ^ 1 + - J <j) bb 

and that infinitely many individuals of genotype aa are present at time 0. 
Then there is fixation of the gene of type a provided 4>\, a is sufficiently small. 

By assuming that cp aa = 4> ab and <pba = <ftbb, the birth rate of a gene (of 
either type) at site x does not depend upon the type of the second gene 
present at site x. This makes our process a biased voter model and allows 
us to define naturally a dual process. In all the other cases, duality is not 
tractable. The proofs of Theorems 3 and 4 are made difficult by this lack of 
a dual process. To establish Theorem 3, we will rely on a coupling argument 
to compare the process with a biased voter model in which genes of type a 
are more competitive than genes of type b. The proof of Theorem 4 is more 
difficult. The idea is to prove that, under condition (4) and for the process 
starting with all sites in = [-N, N] d in state aa, the probability that 
genes of type b reach site decreases exponentially fast with the length side 
N. This will be estimated by studying the evolution of the process along all 
the paths starting outside i*jv and ending at site 0. Then we will use the 
individual of genotype aa at site as a source to produce a large cube void 
of genes of type b and will conclude by comparing the process viewed on 
suitable length and time scales with oriented percolation. A time rescaling 
in Theorem 4 suggests that cj>b a being fixed if the parameters <f> aa and 4>bb are 
sufficiently large, and if condition (4) holds, then there is fixation of the gene 
of type a. Unfortunately, this cannot be deduced from our result since (j>ba 
in the statement of Theorem 4 has to be smaller than some constant that 
(a priori) depends upon the values of the parameters cp aa and <f>bb- However, 
numerical simulations suggest the following. 

Conjecture 5 (Fixation). Assume that cj) a b and 4>b a are fixed and that 
infinitely many genes of type a are present at time 0. Also, assume that 
the ratio (fiaa/^bb > 1 is fixed. Then there is fixation of the gene of type a 
provided (p aa is sufficiently large. 

Numerical simulations also indicate that when <f) aa = (fibb > 4>ab = 4>ba (founder 
control for the mean-field model), clustering occurs in any dimension for the 
spatial model. The right-hand side of Figure 3 gives an illustration in d = 2. 

Theorems 1 and 3 show similarities with the mean-field model. Theorem 
4 implies the existence of a parameter region for which fixation occurs for 
the spatial model (the "winner" depends on the choice of the parameters) 
while the mean-field model exhibits founder control. In both models, co- 
existence cannot occur for these parameters. Similarly, Conjecture 5 and 
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Theorem 6, formula (5), below which confirms our conjecture in d = 1 ex- 
tend the parameter region for which one of the genes goes extinct eventually. 
On the contrary, Theorem 6, formula (6), shows a major difference between 
spatial and nonspatial models, namely the set of parameters for which co- 
existence occurs is smaller for the spatial model (we believe that this holds 
in any dimension, though our proof relies heavily on geometric arguments 
true in d = 1 only). This might provide a theoretical explanation for the 
low frequency of the t-haplotype in the case of the house mouse. To model 
the evolution of the house mouse, we first choose 4>ba < 4> a b where a rep- 
resents the i-haplotype to take into account the fact that the i-haplotype 
is transmitted to a majority of the offspring. Since male offspring who are 
homozygous for the t-haplotype are sterile, we also fix <f> aa = 0, though our 
model does not distinguish between male and female. Finally, we let the 
last parameter be 4>bb = (l/2)(</ , a fo + 4>ba), assuming that ab individuals are 
overall equally fit as bb individuals. This set of parameters corresponds to 
points of the phase diagram which are well inside the coexistence phase for 
the nonspatial model (see Figure 1) but to points which are much closer to 
the boundary of the coexistence phase where the frequency of type a should 
be expected to be lower for the spatial model (see Figure 2). In particular, 
the spatial component might be responsible for the low frequency of the 
t-haplotype in the example of the house mouse. 

Theorem 6 (Fixation). Assume that d = 1 and that infinitely many 
individuals of genotype aa are present at time 0. Then whenever one the 
following two conditions holds: 

(5) 4>aa > 4>bb + 4>ba + \Z<Pbb4>ba, 



there is fixation of the gene of type a. 

The rest of this paper is devoted to proofs. In Section 2, we investigate 
in detail the existence and the stability of the fixed points of the mean-field 
equations (l)-(3). In Sections 3-6, we will prove the results regarding the 
spatially explicit model. 

2. The mean-field model. This section is devoted to the study of the 
mean-field model introduced in Section 1. First of all, by setting the right- 
hand sides of (1) and (3) equal to 0, we obtain 




and (pbb > small, 



(2(j) bb U bb + 4>baUab) = Uab(2(j) aa U aa + (f) 

ab^ab ) 

2u bb (2(j) aa Uaa + 4>abUab) = U ab {2<f) bb U hb + (j) ba^abji 
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so that <iu aa Ubb = u\ h . Note that the previous equation holds when 2<p aa u aa + 
^ab^ab = as well since in this case u aa = u ab = 0. Substituting this into (1) 
and setting the right-hand side equal to 0, leads to 

= (foa ~ 4>ba)u a aUab + (V 2 )^"^ _ 2 0&&«aa«66 
= (<t>aa ~ <t>ba)Uaa.Uab ~ 2(fo b ~ 4>ab)u a a U bb = 0, 

from which it follows that 

U aa = Or ((j) aa - 4>ba)u a b = 2(fo b ~ 4>ab)Ubb 

at equilibrium. Similarly, using (3), we find 

U bb = Or (fo b - 4>ab)u a b = 2(<t>aa ~ foa)u a a 

at equilibrium. The condition u aa = gives rise to the trivial equilibrium 
u b b = 1, and the condition u bb = to the trivial equilibrium u aa = 1. If 
u aa u bb / 0, then 

2(foa ~ foa) Uaa = (foa ~ foa)(fob ~ fob)u a b = 2(fo b ~ fob) 2 U bb - 

Setting cj) = (fo a - fo a ) + (fo b - <p ab ), we get 

4> 2 Uaa = {<t>aa ~ foa) 2 Uaa + 2(fo a ~ foa)(fob ~ <fiab)u a a + (fob ~ fob) 2 Uaa 
= (fob - 4>ab) 2 Ubb + (fob - 4>ab) 2 U a b + (fob ~ fob) 2 Uaa 
= {<t>bb ~ fob) 2 - 

By using the symmetry of the model, we deduce that at equilibrium 

4> 2 Uaa = (fob ~ fobf, 
4> 2 Ubb = (foa ~ foa) 2 , 
4> 2 U ab = 2(0aa - fo a )(fob ~ fob)- 

Therefore, the condition for the existence of a nontrivial equilibrium is given 
by 

1p ■= (foa - foa) X (fob ~ fob) > 0. 

The stability of the equilibria can be analyzed using standard linearization 
techniques. Namely, denoting by J eq the Jacobian matrix at eq, the equilib- 
rium is locally stable if and only if all the eigenvalues of J eq associated with 
eigenvectors v = (v±,V2, v$) such that v 1 + V2 + V3 = are negative. We first 
look at the stability of u aa = 1. The Jacobian matrix is given by 

/0 foa~ foa -2 ' fob \ 
Ja= -fo a + fo a 2fo a + 2fo b \ . 
\0 -2 foa J 
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The eigenvalues of the matrix J a are 0, —(j)aa + <Pba and —2<j) aa . The eigenspaces 
associated with the eigenvalues and — (f) aa + are respectively, given by 

Span((l,0,0)) and Span((l, -1,0)) 

so that the equilibrium u aa = 1 is locally stable if and only if <f) aa > <f>b a - By 
symmetry, the equilibrium Ubb = 1 is locally stable if and only if cftbb > 4>ab ■ 
Now, by letting 

Ipaa = (4>aa ~ 4>ba) 2 , 
^bb = (4>bb -<t>ab) 2 , 
?Pab = (4>aa ~ 06o)(066 ~ <t>ab), 

we find that the Jacobian matrix at the interior fixed point is given by 

I / -2lp aa (j)ab ^abi^bb + <j>ab) -2lpbb4>bb \ 

Jab = —n I 2 Vw(0aa + <f>ab) —tpab{4>aa + 0bb + 4>ab + <t>ba) 24>bb(<f>bb + <Pba) J , 
V -2i'aa(f>aa Tpabi&aa + 4>ba) -2lpbb4>ba ) 

where <p = (4> aa — (ftba) + (4>bb — <t>ab) as above. First of all, it is easy to check 
that is an eigenvalue, and that it is associated to the eigenvector 

{{(j)bb ~ 4>ab) 2 ,2(<t>aa ~ ^ba)(^66 ~ 0a&)> (^aa ~ 4>ba) 2 ) 

whose coordinates add up to only if ijj < 0. In particular, since the condition 
ip > is required for the existence of an interior fixed point, the eigenvalue 
is irrelevant in the analysis of the stability. Now, we introduce the curve 

T = {(u aa ,u a b,Ubb):u aa =p 2 ,u a b = 2p(l-p), 

u bb — (1 — p) 2 for some p £ [0, 1]}. 

This curve is clearly visible in Figure 4. A population of diploid individuals 
in which the genotype frequencies lie on T is said to be in Hardy- Weinberg 
equilibrium. When Mendel's law of segregation holds, points in this curve 
correspond to the limiting genotype frequencies of a nonspatial population 
(mean-field model), with p denoting the initial frequency of genes of type a. 
Note however that the inclusion of local interactions translates into spatial 
correlations that lead to an increase of the density of homozygous indi- 
viduals, and a decrease of the density of heterozygous individuals. For the 
general mean-field model (when Mendel's law does not hold), points in T 
are no longer equilibrium points, but the curve is globally invariant under 
the time evolution. Observing in addition that T contains the interior fixed 
point (take p = (4>bb — 4>ab)4 ) ~ 1 )^ we fi n d that 

{(<t>bb ~ 4>ab), (4>aa ~ 4>ba) ~ (4>bb ~ <Pab), -{4>aa ~ 4>ba)) 

is an eigenvector, and that it is associated to the eigenvalue 

01 := (4>aa ~ 4>ba)(4>bb ~ <t>ab)((<t>aa ~ 4>ba) + (<Pbb ~ 4>ab))- 

This shows that, in the direction of the curve T, the interior fixed point is: 
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1. locally stable if (j) aa < (j) ba and <j) bb < 4>ab, 

2. unstable if 4> aa > <p ha and 4> hb > (p ab . 

This proves the instability when cf) aa > cp ba and 4> bb > 4> ab . See Figure 4 for a 
picture of the solution curves. Finally, using that and <j>\ are eigenvalues, 
it is straightforward to conclude that (as the third root of the characteristic 
polynomial) the third eigenvalue is given by 

<t>2 ■= -2((0oo - 4>ba)((t>bb ~ 4>ab){4>aa + 4>bb) 

+ (4>bb ~ 4>ab) 2 4>ba + {4>aa ~ 4>ba) 2 '<f>ab) 

which is always negative whenever ip = (cf) aa — 4> ba )(4> bb — 4>ab) > 0, the con- 
dition required for the existence of an interior fixed point. This proves the 
stability when 4> aa < 6a and 4> bb < 4> a b- 



3. Proof of Theorem 1. The aim of this section is to prove that both 
alleles may coexist provided 4> aa and 4> bb are sufficiently small. We will start 
by assuming that 4> ab = <p ba > and will explain at the end of this section 
how to deduce the result in the 1-dimensional case when 4> ab < 2cp ba and 

4>ba < ^4>ab- 

First, we observe that when 4> ab = 4> ba > and cp aa = 4> bb = 0, the local dy- 
namics of heterozygous sites can be defined regardless of the type of nearby 
homozygous sites. More precisely, by setting = homozygote of either type 
and 1 = heterozygote, or, equivalently, by letting fj t denote the stochastic 
process defined for any x £ Z rf and any time t > by 



(7) Vt(x) 



1, if r] t (x)=ab, 
0, otherwise, 



we obtain a new Markov process with state space {0, 1} Z whose state at 
each lattice site flips from to 1 and from 1 to at rate <j) ab times the 
number of neighbors in state 1. This process is known as the annihilating 
branching process (ABP) and has been investigated by Sudbury (1990) and 
Bramson, Ding and Durrett (1991). Their results show that starting with 
any measure, which puts no mass on the empty configuration, the ABP 
converges in distribution to the Bernoulli product measure with density 1/2. 
This implies that alleles of type a and b coexist when <p aa = 4> bb = 0. To prove 
Theorem 1, we will employ a rescaling argument [see Bramson and Durrett 
(1988)] where we tile the space-time region % d x Z + into large boxes, define 
a "good event" in each box and show that if a good event occurs in one 
such box, it will with probability close to 1 occur in neighboring boxes at 
a later time. Comparison with an oriented percolation process on Z x Z + 
then completes the proof. The "good event" is the presence of heterozygous 
individuals in a given box if one thinks of the process r/t, or the presence of 
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l's in a given box if one thinks of the process fjt- To show that heterozygosity 
can spread, we define a repositioning algorithm that shows that heterozygous 
individuals spread out linearly in the direction of any of the coordinate axes 
(Lemma 3.1). The reason for invoking the rescaling argument is that once 
the process is embedded in a block structure, it is straightforward to apply a 
perturbation argument and deduce that coexistence occurs as well when <f> aa 
and (ftbb are sufficiently small. The main ingredient to compare the ABP with 
a percolation process is that particles can be moved along a given straight 
line A while staying, with probability close to 1, within a reasonably short 
distance of A (see Lemma 3.1 below). 

For i 6 {1,2, ... ,d}, we let ej denote the ith unit vector, 7Tj denote the 
orthogonal projection on the ith axis, and Hi denote the hyperplane orthog- 
onal to e-i, that is, 

TTi(x) = (x,ei) = Xi and Hi = Span(ei, e 2 , . . . , ej_i, e i+1 , . . . , e d ), 

where ( ■, • ) is the usual inner product on W 1 . We also set 

k 

H itk = [-k°- 7 ,k°- 7 ] d nHi and E i>k = \J (H^+ja), 

j=-k 

where A; is a large integer whose value will vary (increase) from lemma to 
lemma. For to € {— 1, +1} and j = 1, 2, . . . , k, we introduce the hitting times 

T uj,i,j,k = inf{t > : fjt(x) = 1 for some x € H ik + ujjei} 

and T U) i ) k = sapj—i2 k Tu,i,j,k- Roughly speaking, the next lemma tells us 
that, with probability close to 1 when k is large, a particle at site can 
be moved of a distance k along the ith axis while staying within a distance 
k ' 7 of this axis. This can be done in less than Bk units of time for some 
constant B that depends on the parameters of the process but not on k. 

Lemma 3.1. Letfj (0) = 1 audi G {1,2, . . . ,d}. There exist B < oo, C\ < 
oo and ct\ > independent of k such that 

P(Tuj,i,k > Bk) < C\ exp(— aiA; 0,1 ) and 

P(fjt(x) = for all x G Ei^ and some t < T U) i t k) < C\ exp(— ai/c ' 1 ) 

for k sufficiently large. 

Proof. Note that, by symmetry, it suffices to prove Lemma 3.1 for 
to = +1. The idea is to construct a jump process X t that keeps track of 
a particle moving towards the hypercube Hi ^ + kei while staying within 
a distance k° of the ith axis. The process Xt starts at Xq = and the 
evolution rules are formulated according to whether the process moves in 
the direction of the vector e% or in the hyperplane Xt + Hi as follows: 
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1. Whenever fj t (X t + = 1, the process X t jumps instantaneously to X t + 
e,. In particular, site Xt + is empty, that is, in state 0, at any time. If 
the particle at Xt is killed by a particle located at Xt — e, (due to the birth 
of a gene originated from site X t — ei), then X t jumps instantaneously to 



2. If the particle at site Xt gives birth to a particle which is sent to x € 
Xt + Hi, then Xt jumps to site x. If the particle at site Xt is killed by 
a particle located at x € Xt + H , (due to the birth of a gene originated 
from site x G Xt + -ffj), then Xt jumps to cc. 

Rules 1 and 2 are illustrated in Figure 5 (note that the definition of Xt 
implies that at time t site Xt is always occupied). The picture shows a 
configuration of the process fj t together with the position of X t . Open circles 
refer to empty sites (state 0) and full circles to occupied sites (state 1). If 
a particle gives birth through a j-arrow, then the process Xt jumps to site 
xj, j = 1,2,3,4. Births through 1- and 2-arrows illustrate rule 1 (jumps in 
the direction of e,), while births through 3- and 4-arrows illustrate rule 2 
(jumps in the direction of Hi). Lemma 3.1 is a straightforward consequence 
of Lemmas 3.2 and 3.3 in which we investigate the projections of the process 
Xt on the ith axis and the hyperplane Hi, respectively. 

In the next lemma, we prove that rule 1 above implies that, with proba- 
bility close to 1 for k large, the projection m(Xt) reaches k in less than Bk 
units of time and before reaching —k. 



X t - e.j. 




Fig. 5. Evolution rules of the process Xt. 
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Lemma 3.2. Let a^.i = fnf{t > : TTi(X t ) > k}. Then there exists B < oo 
such that, for k sufficiently large, 

P{&w,i > Bk) < C2exp(— a2&) and 

P(iTi(Xt) < —k for some t < cr^^) < C2 exp( — 02k) 

for suitable constants C2 < 00 and ct2 > 0. 

Proof. We observe that TTi(Xt) increases by one whenever a particle 
is sent to site X t + (type 1 arrow in Figure 5), which occurs at rate at 
least (p a b = <frba since X t is occupied. Rule 1 above also implies that the 
process iri(X t ) can decrease by one only if X t — is occupied, the decrease 
resulting from the birth at Xt of a gene that originated from site Xt — ei 
(type 2 arrow in Figure 5). This occurs (when X t — is occupied) at rate 
4>ab/2 + ^6a/2 = 4>ab- I n other words, if we introduce the gap process 

G t = inf{j > : fj t (X t - (j + l)e t ) = 1}, 

then TTi(Xt) jumps to the interval [ni(Xt) + l,oo) at rate at least 4> a b re- 
gardless of the value of the process Gt and jumps to iTi(Xt) — 1 at rate (p a b 
only when Gt = 0. In addition, a straightforward calculation shows that Gt 
makes transitions as follows: 

— ► I\ at rate at least 

l\ — ► at rate at most 

where 1\ = {1,2, . . .} and where "0 — > ii" (respectively, "/1 — ► 0") stands for 
some transition from to m (respectively, from m to 0) for some mE.Ii. 
When site Xt — 2ei is empty, the first rate is the rate at which particles at 
X t — C{ and X t kill each other. When X t — 2ei is occupied, it is the rate at 
which particles at Xt — 2ej and Xt kill the particle at site Xt — ei. The upper 
bound for the second rate is reached when all the neighbors of sites X t — 
and Xt + e, are occupied. The evolution rules imply that 

P(G t+h = 0) < P(G t = 0) x [1 - 20 ab /i] + P(G t + 0) x Ad<f> ab h + 0(h 2 ). 

By subtracting P(Gt = 0) from both sides, dividing by h and taking the 
limit as h — > 0, 

jP{G t = 0) < -24> ab x P(G t = 0) + 4# a6 x P(G t + 0) 
<4d(t> a b-(M + 2)4> ab P(Gt = 0) 
from which it follows that 

2d 



F < G °=°> 2 d+ l 



x exp(-(4:d + 2)<j) a bt). 



SPATIALLY EXPLICIT NON-MENDELIAN DIPLOID MODEL 



17 



Taking the limsup as t — > oo, we get 

limsu P P(G i = 0)<-^-. 

t—>oo la + 1 

Finally, by using that iTi(X t ) increases by one at rate at least 4> a b, and 
decreases by one at rate at most <j) a b when Gt = and at rate when Gtj^O, 
we obtain 

( 8 ) i iminf !Ii^±l> 0^ -^x limsup P(G t = 0)>-|^ T . 

In particular, by setting B = Ad(f>~^ , we can conclude that 

P(a Ui i > Bk) < P(iri(X t ) < k for all t<Bk)< P{-Ki{X Bk ) < k) 

= P(Tn(X Bk ) < (<j> ab /4d) x Bk) < C 3 exp(-a 3 k) 

for suitable C 3 < 00 and 03 > and k sufficiently large. The limit in (8) also 
implies that 

P(iTi(Xt) < —k for some t < cr Ut i) < P(ni(X t ) < —k for some t > 0) 

< C4exp(— a^k) 

for suitable C4 < 00 and 04 > and k sufficiently large. The lemma follows. 
□ 

The next lemma shows that rule 2 describing the evolution of X% implies 
that the norm of the projection of Xt on the hyperplane H{ is bounded by 
k 0,7 until time Bk. 

Lemma 3.3. Let B be given by Lemma 3.2 and j € {1, 2, . . . ,i — 1, i + 
1, . . . , d}. Then, for k sufficiently large, 

P(TTj(Xt) i [-k°- 7 ,k - 7 ] for some t<Bk)< C 5 exp(-a5/c ai ) 

for suitable C5 < 00 and 05 > 0. 

Proof. Let j 7^ i. We observe that, when site x = Xt + ej is empty, the 
process Xt jumps to x whenever the particle at Xt gives birth to a particle 
which is sent to x (type 3 arrow in Figure 5), which occurs at rate 4> a b. When 
site x is occupied, the process X t jumps to x whenever the particle at x kills 
the particle at X t (type 4 arrow in Figure 5), which also occurs at rate 4> a b- 
The same holds by replacing Xt + ej by Xt — ej. This implies that TTj(Xt) 
is a continuous-time symmetric random walk starting at TTj(Xo) = run at 
rate <f> a b- This is the main ingredient to prove the lemma. Let denote 
the number of times TTj(Xt) jumps by time Bk, and Y n the discrete-time 
version of Wj(Xt) so that TTjiX^k) =^M fe - The expectation of is given 
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by m k = 2Bk(p a b. By decomposing the event to be estimated according to 
whether M k > 2m k or M k < 2m k , we obtain 

P(TTj(X t ) i [-k°- 7 ,k - 7 ] for some t < Bk) 

< P{M k > 2m k ) + P(Y n <£ [-k - 7 , k - 7 ] for some n < 2m k ; M k < 2m k ). 

Large deviation estimates imply that the first term can be bounded as fol- 
lows: 

P(M k > 2m k ) < C 6 exp(-a 6 k) 

for suitable Cq < oo and a§ > 0. The second term can be estimated by first 
using the reflection principle and then Chebyshev's inequality. Given 8 > 0, 
we obtain 

P{Y n i [~k - 7 , k - 7 } for some n < 2m k ;M k < 2m k ) 

< 2P{Y 2mk $ [-k°- 7 ,k - 7 ]) <2exp(-0k°- 7 )Eexp(0Y 2mk ) 

<2exp(-9k - 7 ) J] Eexp(#(Y n - Y n ^)) 

n=l 

<2exp(-6»/c a7 + 2m k log 0(0)), 

where 4>{9) is the moment generating function of Y\. Since EYi = and 
VarYi < oo, we have that log (/>(#) < C-j6 2 for some Cj < 00 and for 6 small 
enough. By taking 9 = A;" ' 6 in the last expression, we can conclude that 

P(Y n (£ [-k - 7 , k - 7 } for some n < 2m k ;M k < 2m k ) 

< 2exp(-k - 1 + AC 7 ab Bk~ - 2 ) < 2exp(-k°- 1 /2) 

for all k sufficiently large. This completes the proof. □ 

To deduce Lemma 3.1, we first observe that Lemmas 3.2 and 3.3 imply 
that 

P(T u>i>k > Bk) < P{a^i > Bk) 

+ P(TTj(X t ) (£ [-k - 7 , k - 7 ] for some t < Bk and some j ^ i) 
< C 2 exp(-a 2 k) + (d- 1) x C 5 exp(-a 5 /c ai ). 
By Lemmas 3.2 and 3.3, we also have 

P(rjt(x) = for all x £ Ei yk and some t < r^^k) 

< P(-Ki(X t ) < —k for some t < <Ju,i) 

+ P(iTj(Xt) $l [— k ' 7 , k ' 7 ] for some t < (T u i and some j 7^ i) 

< P(iTi(X t ) < -k for some t < a^i) + P(<7 w ,i > Bk) 

+ P(irj(Xt) £ [— k - 7 , k - 7 ] for some t < Bk and some j ^ i) 

< 2C 2 exp(-a 2 k) + (d - 1)C 5 exp(-a 5 k 0A ) 
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which concludes the proof of Lemma 3.1. □ 

Note that rule 1 describing the evolution of Xt allows to give the process 
Xt a drift in the direction of the vector ej whereas rule 2 implies that the 
projection of Xt on the orthogonal hyperplane Hi evolves according to a 
random walk without drift. However, rule 2 can be defined in such a way 
that the tagged particle has an additional drift towards the ith axis. This 
approach would probably strengthen Lemma 3.1 as follows: a particle at site 
can be moved of a distance k along the ith axis while staying within a 
distance independent of k of the axis. This result is obviously more difficult 
to prove than our Lemma 3.1 but would probably shorten the proofs of 
Lemmas 3.4 and 3.5 below. 

The next step is to show that, for given e > 0, the particle system, when 
viewed on suitable length and time scales, dominates (in a sense to be made 
precise) the set of wet sites of an oriented percolation process on the lattice 

Q = {(z,n) € Z 2 : z + n is even and n > 0} 

in which sites are open with probability 1 — e [see Durrett (1984), for more 
details about oriented percolation]. For k = 1, 2, . . . , let J K = [—N/k, N/n) d 
where N is a large integer to be fixed later and let T > 0. A site (z,n) € Q 
is said to be occupied if there is at least one particle in the translated cube 
Nze\ + J4 at time nT, that is, 

VnT(x) = 1 for some site x E Nze\ + J4. 

The objective is to show that if site (z, n) G Q is occupied then (z — 1, n + 1) 
and [z + l,n + 1) are occupied with probability at least 1 — e. Since the 
evolution rules of the particle system are translation invariant in space and 
time, it suffices to prove the result when z = n = 0. To prepare for the 
rescaling argument, we need the following lemma, which will be applied 
repeatedly and in which we prove that, with probability close to 1 when TV" 
is large, a particle in J^ K can be moved to the smaller cube Jiq k in less than 
dBN units of time and before stepping out of J R . 

Lemma 3.4. Let k be a positive integer and consider the stopping times 
Vi n = inf {t > : fft(x) = 1 for some x G Jiqk} and 
v out = inf {t > : fj t (x) = for all x£j K }. 
If there is at least one particle in J± K at time 0, then 

P(v in < dBN;v m < v out ) > 1 - C 8 exp(-a 8 iV ai ), 



where B is given by Lemma 3.1 and for suitable C$ < 00 and as > 0. 
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Proof. The idea is to apply Lemma 3.1 d times in each direction i G 
{1,2, ... ,d} to move a particle from anywhere in to Jiq k . We refer the 
reader to Figure 6 for an illustration. To make the argument precise, we let 
X G Ji K with f]o(X) = 1 and, for j = 0, 1, . . . , d, set 

Dj = [-jN°- 7 ,jN°- 7 ] d + 7T j+1 (X)e j+1 + 7T J+2 (X)e j+2 + ■■■ + 7r d (X)e d , 
Vj = inf{i > : fj t (x) =1 for some x G Dj} 

and ujj = — sign(7Tj(X)). Note that the sequence starts at Do = X. By ob- 
serving that 

D d C Ji6re for all N sufficiently large and 
+ {x + E ijN/2K } n Hi C A for all x G A-l 
and applying Lemma 3.1 with i = 1, 2, . . . ,d, k = N/2k and u = u>i, we obtain 

P(u iB > dBN) < P(v d > dBN/2K) < ^ P(t^ - > BN/2k) 

i=i 

d 

<Y. P ( T ^k > BN/2k) < C 9 exp(-a 9 iV - 1 ), 
i=l 

where Cg = C\ x d and ag = qi/(2k) ' 1 . Since x + -E^a^k C J k for all x G 
A-l an d the evolution rules of the process are translation invariant in space 
and time, we can also deduce that 

d 

P(v in > V out ) < P(v d > V out ) <^P(Vi > V out \Vi-l < V out ) 

1=1 

d 

< SU P p {Vt(z) = for all z G x + E^ n/2k 




u> 2 = -1 Jie K 



7Tl(X) ei 



Fig. 6. Picture of the block argument. 
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< P{Vt(z) = for all z e E i>m 



i=i 



and some t < T Ui>ijk | 770(0) = 1) 
<C 9 exp(-a 9 iV - 1 ), 
where Cg < 00 and ag > are denned as above. This completes the proof. 



The next lemma provides the main estimate for the rescaling argument. 

Lemma 3.5. Let T = (3d + 1)BN and assume that fjo(X) = 1 for some 
J G J4. Then 

P(ffr(x) = for all xeJ 4 + N ei )< C w exp(-ai iV ai ) 
for suitable C\q < 00 and a 10 > 0. 

Proof. For any integer i > 1, we introduce the stopping times 
rp _ / inf{t > Tj—i : fjt{x) = 1 for some x 6 Je4 + Ne±}, for j odd, 



with the convention To = 0. In words, the stopping times Tj's tell us alter- 
nately when a particle appears in J 64 + Ne 4 and when J 17 + Ne\ becomes 
empty. Let T* be the first exit time 



Then, to establish the lemma, it suffices to prove that T\ <T and T* > T 
with probability arbitrarily close to 1 when N is large since this indicates 
that a particle appears in J134 + Ne\ by time T and then stays inside J4 + Ne\ 
until time T. 

To estimate the event that T\ < T, we start by applying successively 
Lemma 3.4 to bring a particle from J4 to J\q, then from J\§ to Jq 4 , and 
finally from to J256 in less than 3dBN units of time. More precisely, we 
have 

P(fj t (x) = for all x G J 256 and all t < MBN) < 3 x C 8 exp(-a 8 Af - 1 ). 

Once there is a particle in J256, we apply Lemma 3.1 with i = 1 and k = N 
to conclude that 



□ 




for j even 



T* = inf{t > Ti : ^(x) = for all x € J 4 + iVei}. 



(9) 



P(Ti > T) = P(Ti > (3d + l)BN) 

<3 x C 8 exp(-a 8 Ar ai ) + Ciexp(-aiiV ai ) 
<Cnexp(-a 11 Af ai ) 
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for N large enough so that J256 + [— N - 7 , N°- 7 ] d C Jq4 and suitable C\\ < 00 
and an > 0. 

We now estimate the event that T* > T, which is the event that box 
J4 + iVei contains at least one particle from T\ to T . Due to nearest neigh- 
bor interactions, there is at least one particle within distance 1 from the 
boundary of Jyj + Ne±, and so contained in J\§ + Ne±, at time I}— 1 when 
j is odd. In particular, by applying Lemma 3.4 with k = 4, we get 

(10) P(Tj > T*\Tj-i < T*) < C 8 exp(-a 8 iV ai ) for j odd. 

Now, the definition of the Tfs implies that, between time Tj-\ and time Tj 
for j even, there is at least one particle in the box Jyj + Ne\. It follows that 

(11) P(Tj > T*\Tj-i < T*) = for j even. 

Roughly speaking, (10) and (11) tell us that the number of times a particle 
appears in J64 + iVei and Ji7 + iVei becomes empty before J4 + iVei becomes 
empty can be made arbitrarily large by choosing N large. We still need to 
prove that the number of steps up to time T is smaller than a constant 
uniform in N by estimating 

K = inf{j >l:Tj>T} 

which is about twice the number of times Jyj + iVei becomes empty by 
time T. Since particles die at rate at most 2d(J) a b and the number of deaths 
required to empty J17 + iVei starting with at least one particle in Jq^ + iVei 
is larger than N/32, we obtain 

E[Tj - > N x (Md(j) ab )- 1 for j even. 

Using the fact that T is equal to a constant times N, large deviation es- 
timates imply the existence of some m = m(d, 4> a b) independent of N such 
that 

(12) P(K >m)< C12 exp(-ai 2 iV) 

for suitable C\i < 00 and a\i > 0. 

Combining estimates (9)-(12), we can conclude that 

P(ffr(x) = for all i£j 4 + Nei) 

< P(T X >T) + P(T* < T) 

K 

< P(Ti >T) + J2 p (Tj > < T*) 

i=2 

m 

< P(Ti > T) + P(K >m) + Y, P(Tj > T*\Tj-i < T*) 

3=2 

< Cnexp(-aniV - 1 ) + Ci 2 exp(-ai 2 A r ) + m x C 8 exp(-a 8 Af ai ). 
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This completes the proof of the lemma. □ 

Lemma 3.5 implies that, for any e > 0, the parameters N and T can 
be chosen in such a way that, provided that site (0,0) is occupied, (1,1) 
is occupied with probability at least 1 — e. Now, since the range of the 
interactions is finite, the events described in Lemmas 3.2-3.5 have a finite 
range of dependence. In particular, with probability at least 1 — e for N 
sufficiently large, the configuration of the process at time T in the box Ne\ + 
[-N, N] d only depends upon the graphical representation in the space-time 
region 



This implies the existence of an event Go,o that has probability at least 
1 — 2e, which is measurable with respect to the graphical representation of 
the process restricted to -Bo,o> and such that, on the event that site (0,0) is 
occupied and Go,o occurs, site (1, 1) is occupied. Finally, since the evolution 
rules of the process are translation invariant in space and time, we conclude 
that there exists a collection of events {G z<n :(z,n) £ such that, for any 
e > 0, the parameters N and T can be chosen in such a way that: 

1. The event G z , n is measurable with respect to the graphical representation 
of the process restricted to the space-time region B z<n = (Nzei,nT) + 
{[-2N,2N] d x [0,T]}. 

2. P(G Zjn ) > 1 - 4e. 

3. If (z, n) is occupied and G z ^ n occurs, then {z — 1, n + 1) and (z + 1, n + 1) 
are occupied. 

This implies that the set of occupied sites dominates the set of wet sites in 
a 1-dependent oriented site percolation process on Q in which sites are open 
with probability p = 1 — 4e [see Durrett (1995) for a rigorous proof]. This 
completes the proof of Theorem 2. 

To deduce Theorem 1, the last step is to apply a perturbation argument 
to prove that the previous property holds when cj) aa and <f>bb are positive but 
small. The probability that a gene originated from a homozygote is sent to 
the space— time box B z n can be bounded by 



x&[-2N,2N} d 

where <j> = 2{(p aa + 4>bb)- The parameters N and T being fixed so that, when 
4>aa = 4>bb = 0, the set of occupied sites dominates the set of wet sites in a 
1-dependent percolation process with parameter 1 — 4e, the rates <p aa and 
fibb can be chosen so small that the probability in (13) is smaller than e. 
With this choice, the set of occupied sites now dominates the set of wet sites 



£ ,o = (Ne u T) + {[-2iV,2Af x [0,T]}. 



(13) 



E 
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in a percolation process with parameter 1 — 5e. To conclude, we take e > 
sufficiently small so that percolation occurs, and start the particle system 
from the "all aV configuration. Then the limit v is a stationary distribution. 
Moreover, the process being coupled with a supercritical oriented percolation 
process, the density of heterozygotes under the measure v is strictly positive. 
This completes the proof of Theorem 1 when qb a b = (f)^ a . 

To conclude this section, we now prove Theorem 1 when <p a b ^ 4>t, a and 
d = l. Since the assumption <p a b = cfiba has only been used in the proof of 
Lemma 3.1, it suffices to extend Lemma 3.1 (see Lemma 3.7 below) to the 
1-dimensional case when 4> a b < 2(pba and cftba < 2<^ a &. The difficulty arising 
from the fact that 4> a b ^ 4>ba is that births of heterozygous individuals onto 
sites in state aa occur at rate a & whereas births of heterozygous individuals 
onto sites in state bb occur at rate <f>b a - Without loss of generality, we can 
assume from now on that (f) a b > (pba- To facilitate the writing of the proof, 
we will use as above the process fjt defined in (7). Note however that, due 
to the general assumption <j) a b ^ 4>ba, the process fjt is no longer a Markov 
process. It should be thought of as an auxiliary process that allows us to 
keep track of heterozygous sites. Let 

Xf = sup{x € Z : fjt(x) = 1} and Xf = inf {x 6 Z : fj t (x) = 1} 

denote the right and left edges of the process at time t. Then the following 
lemma holds. 

Lemma 3.6. Assume that </> a & < 2(pb a an d start the process with a single 
particle at site 0. Then there exists a constant C13 > depending only on 
the rates cf> a b and <f>ba such that 

liminft _1 X t + > C 13 and liminf t^Xf < -C 13 . 

t— »oo t— >oo 

Proof. By symmetry of the evolution rules, it suffices to prove the 
lemma for X^ only. The proof is similar to the proof of Lemma 3.2. We 
introduce the gap process 

G t = mf{j>0:f] t (X+ -j-l) = l}, 
Table 1 

Evolution rules of Gt ■ In our table, Ik = {k, k + 1, . . .}, o = state 0, and • = state 1. 
(0 — » h indicates a jump from state to some state in I\, etc.) The rows refer to the 
cases when Gt = 0, Gt — 1 and Gt > 2, respectively 



Configurations 


Transition 


Rate 


Drift (X+) 


OMOOO ■ ■ ■ •••OOO ■ ■ ■ 

ototoo ■ ■ ■ ••o»oo • • • 
• oo»oo ■ • ■ •oooto • • • 


0- >Ji 

1- »0 
h ^0 


> <f) a l + <f> ba 

< 3</>ab 

< 2<j>ab 


> {(j>ba - 4>ab)/2 

> <t>ba 

> <f>ba 
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and observe that Xf jumps to the left at rate (4> a b + 4'ba)/2 when G t = 0, and 
jumps to the right at rate at least 4> ba regardless of the value of Gt- Moreover, 
a straightforward calculation shows that the process Gt makes transitions 
as indicated in Table 1, where «'s refer to l's and o's to 0's, and where It 
denotes the set {k,k + 1,...}. The transition from to I\ is obtained by 
killing one of the particles at X^~ and X^~ — 1 (first configuration) or by 
killing the particle at Xf — 1 (second configuration). Both events occur at 
rate twice the rate at which a particle is killed by one of its neighbors. The 
last two transitions result from a birth at Xf — 1 or Xf + 1. Since 4> ab > (j) ba , 
upper bounds are reached when each empty site is in state bb. These rules 
imply that 

P(G t+h = 0) < P(G t = 0) x [1 - {<t> ab + <t> ba )h] + P(G t + 0) x 3(P ab h + 0(h 2 ). 
The same argument as in Lemma 3.2 leads to 



liminf t^X+ > 4> ba - x i im P (o t = 0) 

t^oo 2 t^oo 

34>ab 



> (Pba ~ {4>ab + Pba) X 



2(4(f) ab + 4> ba ) 

= 24> ba + ^4>ab4>ba ~ 3^ 6 

2{Acj) ab + (t) ba ) 

The roots of Q(X) = 2X 2 + 5(j) ab X - 3^ b are -3<p ab and 4> ab /2. This implies 
that the last term of the previous inequality is positive if and only if cj) ab < 
2cj) ba and proves the lemma. □ 

We can now use Lemma 3.6 to demonstrate that with high probability l's 
spread out linearly in time and can be found within this growing cone. The 
second part of Theorem 1 then follows as before from a rescaling argument 
and we omit the details. 

Lemma 3.7. Assume that (p ab < 2(f> ba and 770(0) = 1. For to £ {— 1,+1}, 
we let 

T u = mf{t>0:fj t {Luk) = l}. 
Then there exist constants B < 00, C14 < 00 and 014 > such that 

P(t w > Bk) < C14 exp(— auk) and 
P(fjt(x) = for all x G [— k, k] and some t < t w ) < Ci4exp(— auk). 

Proof. By symmetry, it suffices to prove the result for uj = +1. Let 
Yt~ = sup{x < k : fjt(x) = 1} and Y t ~ = ini{x > k : f]t(x) = 1}. 
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Since Y t + and Yf do not interfere by time t w , the process Y t + — Yq~ is equal 
in distribution to the right edge of the particle system starting with 
a single particle at site by time t u . In particular, using large deviation 
estimates, the fact that Yq~ > and Lemma 3.6, we have 

P(j u > Bk) < P(X+ < k for all t < Bk) 

< P{X+ k <k)< C 15 exp(-a 15 k) 

for B = 2/C13 and suitable C15 < 00 and 015 > 0. Lemma 3.6 also implies 
that 

P(fjt(x) = for all x E [— k, k] and some t < t u ) 

< P{Xf < -k for some t > 0) < Cie exp(-ai 6 fc). 
This completes the proof of Lemma 3.7. □ 

4. Proof of Theorem 3. To prove Theorem 3, it is more convenient 
to consider our particle system as a set of genes evolving on "L d x {0, 1} 
rather than a set of genotypes evolving on the lattice 7L d . More precisely, 
we introduce the Markov process £t :7L d x {0, 1} — ► {a, b} whose evolution 
at site (x,g) is given by the following transition rates, where we have set 
h = g + lmod2: 

a — > b at rate 2</>f,6 ^{£{ z i 9) = ^{ z i h) = b} 

\\x—z\\=l 

+ (/) ba J2 t{i{z,g) = a,i{z,h) = b} 

\\x—z\\=l 

+ <j) ba HZ(z,g) = b,£(z,h)=a}, 

\\x—z\\=l 

b^a at rate 2(p aa ^ ±{£{z,g) = £(z, j) = a} 
\\x—z\\=l 

+ <p ab H£{z,g) = a,£(z,h) = b} 

\\x—z\\=l 
\\x—z\\=l 

The process rjt is a genotype-based model whereas the process £j is a gene- 
based model, however both describe the same genetic system in the sense 
that they can be defined on the same probability space in such a way that, 
for any site x € Z d and any time t > 0, 



(14) rn(x)=Zt(x,0)St(xA) or Vt (x) = &(x, 0) 
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provided the property holds at time 0. In view of (14), it suffices to prove 
our results for either one of the two processes. We will prove Theorem 3 for 
the gene-based model. 

The proof of Theorem 3 relies on duality techniques supplemented with a 
standard coupling argument. Let fo = min(0 aa , fob) and fo = m&x(fo a , fob). 
The first step is to construct the gene-based model and the biased voter 
model with parameters fo and fo on the same probability space in such a 
way that & has more genes of type a than the biased voter model (denoted 
by Q t later). To do this, we will construct both processes from the same 
collections of independent Poisson processes, which is referred to as Harris' 
graphical representation [Harris (1972)]. We will conclude by showing that 
Qt converges to the "all a" configuration provided fo > fo. 

To construct the gene-based model £t we put, for all g,h€ {0,1} and 
all sites x, z S Z rf such that ||x — z\\ = 1, a specific arrow from site (x,g) 
to site (z, h) at the arrival times of independent Poisson processes with 
suitable rates. Specifically, for i,j 6 {a, b}, we put, at the arrival times of 
a Poisson process with parameter foj, a type ij arrow from site (x,g) to 
site (z,h) to indicate that if site (x,g) is occupied by a gene of type i and 
site (x,g + 1 mod 2) is occupied by a gene of type j then site (z, h) becomes 
occupied by a gene of type i. This graphical representation allows us to 
construct the gene-based model starting from any initial configuration £q- 
Figure 7 shows a realization of the graphical representation of the gene-based 
process. 

To construct the biased voter model Ct> we put, for all g,h € {0,1} and 
all sites x,z € Z d such that \\x — z\\ = 1, a type i arrow from site (x,g) to 
site (z, h) at the arrival times of independent Poisson processes with rate fo, 
i € {a, b}. This indicates that if site (x, g) is occupied by a gene of type i then 
[regardless of the gene present at site (x,g + lmod2)] site (z, h) becomes 
occupied by a gene of type i. 

To construct both processes on the same probability space, we use Poisson 
processes with rates depending on the signs of 4> aa — 4> a b and fob — fo a - Ta- 
ble 2 gives an explicit description of the graphical representation in the four 
possible cases. The type i arrows, i G {a, b}, in the construction of the gene- 
based model have the same interpretation as the ones in the construction of 
the biased voter model. In particular, a type i arrow in the construction of 
the gene-based model can be seen as the superposition of a type ia arrow 
and a type ib arrow. In view of the well-known properties of the Poisson pro- 
cess, each of these graphical representations produces the desired flip rates 
to construct the processes we are interested in. Moreover, it follows directly 
from Table 2 that 



(15) Ct(x,i) = a => £ t t(x,i)=a for all x£Z and i 6 {0, 1} 
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Fig. 7. Construction of the gene-based model from the Harris' graphical representation. 
The thick lines refer to sites occupied by a gene of type a. 



at any time t > provided this holds at time 0. Note, however, that in 
our coupling the Poisson processes with rate faj, hj £ {«>fr}> are no longer 
independent. 

Assume that <p a > fa and that infinitely many genes of type a are present 
at time 0. The last step is to prove that the biased voter model Ct'-^ x 
{0, 1} — > {a,b} with parameters 4> a and fa converges almost surely to the 
"all a" configuration. The proof relies on duality techniques. We construct 
the process by drawing a type a arrow (respectively, an unlabeled arrow) 
from site (x,g) to site (z,h) at the arrival times of a Poisson process with 
rate <f> a — fa (respectively, fa). Unlabeled arrows indicate that the gene at 
site (z,h) is replaced by the gene at site (x,g) regardless of the type of 
both genes. The dual process is constructed by going backwards in time and 
jumps at the tips of unlabeled arrows while branches at the tips of type a 
arrows as indicated in Figure 8. More precisely, if ( s denotes the dual process 
at dual time s then: 

1. if there is an unlabeled arrow from (x,g) to (z, h) at time t — s for some 
(z, h) 6 Q s - , then Q can be deduced from £ g - by removing site (z, h) and 
adding site (x,g) while 
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Table 2 

Couplings of the gene-based model and the biased voter model 



Rate Gene-based Biased voter 





rh < til 


t b and i 


Pbb Jl < 


i'ba 




0aa 




a 






a 


0a6 - 4>aa 




ab 








<t>bb 




6 






6 


(l)hn — 066 




6a 






6 




4>aa < 01 


.6 and i 


^66 > ' 


/56a 




(j) aa 




a 






a 


4>ab — 4>aa 




a6 








06a 




6 






6 


4>bb — 4*ba 




66 






6 




4>aa > 4>i 


ib and . 


^66 < ' 


Ha 




<t>ab 




a 






a 


4>aa — 4>ab 




aa 








0bb 




6 






6 


06a - 066 




6a 






6 




4>aa > 4>c 


ib and i 


Hb > 1 


Pba 




4>ab 




a 






a 


<t>aa — 4>ab 




aa 








4>ba 




6 

66 






6 
6 


066 — 06a 













2. if there is a type a arrow from (x,g) to (z, h) at time t — s for some 
(z, h) £ Cs-j then £ s can be deduced from £ s - by adding site (x,g). 

The convergence to the "all a" configuration follows from the analysis 
of the dual process. A stronger version of this result is proved in Bramson 
and Griffeath (1980, 1981) for the biased voter model on Z d . Their proof, 
however, easily extends to the process on Z d x {0, 1}. This, together with 
(14) and (15), concludes the proof of Theorem 3. 

5. Proof of Theorem 4. Similar to the proof of Theorem 1, the proof of 
Theorem 4 relies on a rescaling argument. First, we set F N = [-N, N] d for 

> and denote by Pp N the law of the process starting with all sites in Fn 
occupied by individuals of genotype aa, and all sites outside Fn occupied 
by individuals of genotype bb. The first step is to prove the following lemma. 
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Fig. 8. Picture of the dual coalescing branching random, walks. 



Lemma 5.1. Assume (4) and(f)i ia = 0. Then 

PF N (i]t(0) 7^ aa at some time t>0)< Cyj exp(— a\jN) 
for suitable constants C-yj < oo and an > 0. 

The proof of Lemma 5.1 is made difficult by the lack of a dual process. 
To estimate the probability that a gene of type b reaches site 0, we will 
construct a path 7 C Z rf starting outside the cube Fn and ending in the 
neighborhood of site along which genes of type b spread. The construction 
will be done by going backwards in time. By investigating the evolution of 
the process along the invasion path 7 by going forward in time, we will prove 
that the probability that such a path exists is smaller than a constant times 
c~ K for some c > 2d, where K denotes the length of 7. We will conclude the 
proof of Lemma 5.1 by summing over all the possible paths starting at site 
x for some x G Z d \ Fn and ending in the neighborhood of site 0. 

Lemma 5.1 implies that, with probability arbitrarily close to 1 when N 
is large, the individual at site is of type aa at any time t > 0. The second 
step is to use this individual as a source to fill the cube F^n with genes of 
type a. This occurs by some time T < 00 with probability arbitrarily close 
to 1 for N large. By applying the same arguments as in the proof of Lemma 
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5.1 to all the individuals in F2N instead of the individual at site 0, we will 
deduce the following lemma. 

Lemma 5.2. Assume (4) and </>b a = 0. For any e > 0, 
Pf n {j]t{x) 7^ aa for some x 6 i^Ar) < 2e 
/or a suitable integer N > and a (deterministic) time T < 00. 

Lemma 5.2 implies the existence of a partition of Z rf into cubes of side 
length 2N + 1 such that the set of cubes filled with a's at time 2nT, n > 0, 
dominates the set of wet sites at level 2n of a 1-dependent oriented perco- 
lation process with parameter 1 — 2e. This can be proved by applying the 
same arguments as in the proof of Theorem 1 (for more details, see our 
construction after the proof of Lemma 3.5). To prove that genes of type a 
actually outcompete genes of type b, namely that there is an in-all-directions 
expanding region which is void of genes of type b, we apply a result from 
Durrett (1992) that shows that sites that are not occupied do not perco- 
late when e is close enough to 0. Since genes of either type cannot appear 
spontaneously, once a region is void of one type, this type can only reappear 
in the region through invasion from the outside. To complete the proof of 
Theorem 4, the last step is to apply a perturbation argument similar to the 
one described in the proof of Theorem 1 to extend the result to the case 
when (ftb a is small. We now prove Lemmas 5.1 and 5.2 in detail. 

5.1. Construction of the invasion path 7. The first step in proving Lemma 
5.1 is to demonstrate the existence of a path 7 C 7L d along which genes of 
type b spread in the case when the stopping time 

T N = inf{i> 0:^(0) ^ aa} 

is finite. We first let the process starting with genes of type aa in Fn evolve 
until time T/v and then do the construction inductively by going backwards 
in time. 

We set Sq = T/v. The definition of So implies that there is a gene of type 
b that is sent to site at time Sq. This gene must have originated from a 
site To with 1 1 1~*o 1 1 = 1- 

Since (pba — 0j we deduce that site To is necessarily in 
state bb at time So. Then we let 

Si = sup{s < S : 7? s (r ) = aa}. 

Again, there is a site Ti with ||ri — Toll = 1 which is in state bb at time £1. 
Let 



S 2 = sup{s < Si :77a(ri) = aa}. 
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This procedure allows us to construct a sequence (Ti,S{) such that time Si 
is almost surely finite and site Tj is in state 66 at time Si- The construction 
stops at i = K where 

K = min{i > : rj s (Ti) ^ aa for all s £ [0, Si]}. 

Note that this implies that Tk ^ Fn and that K > N . Our construction 
implies that 

< Sk < Sfc—i < • • • < 5*o = T/v- 

Since it will be more convenient to work forward in time, we construct the 
invasion path 7 by reversing the direction of time, that is we set 

7(7) = F K _i and s, = S^-i 

for i = 0, 1, . . . ,K , and 7 = ( 7 (0), 7(1), . . . , 7 (K)). 

5.2. Evolution of the genotypes along 7. The next step is to define a 
process Ut starting at loq = such that 7(0^4) follows the invasion of genes 
of type b in the cube F^. The process Ut is stochastically smaller than 
a process whose probability to reach K decreases exponentially in K (see 
Lemma 5.3 below). This allows us to conclude that the probability that the 
path 7 exists decreases exponentially fast with its length, which is the key 
to prove Lemma 5.1. 

The process u> t starts at ujq = and has values in {0, 1, ... , K}. The tran- 
sition rates depend on whether site 7(0*4 + 1) is in state aa or not. 

1. If j(ut + 1) is in state aa (at time t), then 

{uj t + 1, when the state of 7(1^ + 1) jumps to ab, 
sup{i <u t - 1:774(7(4)) / aa}, 
when the state of 7(^4) jumps to aa. 

2. If 7(0*4 + 1) is not in state aa (at time t), then 

ujt — > sup{i < W4 — 1 : 774(7(4)) 7^ aa} when the state of 7(^4) jumps to aa. 

Note that ujf is well defined until time T/v and sup{i < u>t — 1 : 774(7(7')) 7^ 
aa} > until this stopping time condition on the event that the invasion 
path 7 exists. Our construction also implies that uit jumps from i to i + 1 at 
time Si (see the left-hand side of Figure 9) so that ut reaches state K at time 
sk = Ijv, and that site 7(1^4) is in state a& or 66 up to time sk ■ To keep track 
of the genotype, we define a process 0)4 with state space (Z/2) n (— 1,-K] by 
letting 

w f = uj t - l±{r) t ('y(u t )) = 06,774(7(^4 + 1)) = aa}. 

See the right-hand side of Figure 9. We now describe the process ujt in details. 
The transition rates of the process depend on the state of sites 7 (ujt) and 
7(0^4 + 1), and we distinguish four different cases that we call configurations 
1-4 (see Figure 10 for an illustration): 
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7 (0) 7 (1) 7(2) 7(3) 7(4) 7(0) 7(1) 7(2) 7(3) 7(4) 

Fig. 9. Illustration of u) t (on the left-hand side) and u) t (on the right-hand side) in 
dashed lines from a realization of the particle system. In both pictures, the horizontal axes 
represents both the state space of 7 (on the bottom) and the state space of the processes u>t 
and u>t (on the top). Vertical thick black lines refer to sites in state bb, thick grey lines to 
sites in state ab and dotted lines to sites in state aa. 



1. Config. 1 — site ^(uJt) is in state bb and site 7(0^ + 1) is in state aa. 
In this case, u>t can either jump to cut + 1/2 or G)% + 1 when the state of 
site j(ut + 1) jumps to ab, or jump to u t — 1/2 when the state of site 




Fig. 10. Pictures of the configurations 1~4- 
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7(u>t) jumps to ab which implies that 

r < u>t H- 1 , at rate at most Adcpbb , 
L ° t ~ > \ u>t — 1/2, at rate at least 2(f) aa . 

2. Config. 2 — site 7(0^) is in state bb and site 7(0^ + 1) is not in state aa. 
In this case, the process u>t is frozen. 

3. Config. 3 — site j(uJt) is in state ab and site j(uJt + 1) is in state aa. 

In this case, H>t can either jump to u)t + 1/2 when the state of site 7(0^) 
jumps to bb or the state of site 7(0^ + 1) jumps to afe, or jump to the left 
when the state of site 7(wt) jumps to aa which implies that 

Qt + 1/2, at rate at most 6d(f>bb, 

< Ut — 1/2, at rate at least </> aa . 

4. Config. 4 — site 7(a>t) is in state ab and site j(ut + 1) is not in state aa. 
In this case, the process Hit can only jump to the left. 

The process uit is well defined and > — 1/2 until time T\r condition on 
the event that the invasion path 7 exists. However, when ujt = and the 
configuration is of type 3, the state of 7(0) may still jump to aa at the rate 
indicated above, thus contradicting the existence of 7. This motivates, in 
order to bound the probability that 7 exists, the introduction of the Markov 
process Xt with state space Z/2 and with transition rates 



X t 



/ L-^J + 1) a t ra l e Gdfib 
\X t - 1/2, at rate <f> aa , 



where [ - J denotes the integer part. The choice of the transition rates of Xt 
and the observation above imply that the probability that the invasion path 
7 exists is bounded by the probability that, starting at 0, the process Xt 
reaches state K before state —1. To estimate the latter, we consider the 
discrete-time version of Xt, which is the Markov process Y n with transition 
probabilities 



(16) Y r 



n+l 



\Y n \ + 1, with probability r, 
1^ — 1/2, with probability /, 

where the probabilities r (right) and / (left) are given by 

6#&b , , 4>aa 
r = and / = . 

<Paa + 6d(f)bb 4>aa + 6d(f>bb 

Then, denoting by Tj the first time the process Y n hits state i, we have 
P(7 = (7(0),7(1), . . . ,7(i^)) is an invasion path) 

(17) 

<P{t k < T- X \Y Q = 0), 
which decreases exponentially in K according to the following lemma. 



SPATIALLY EXPLICIT NON-MENDELIAN DIPLOID MODEL 35 
Lemma 5.3. Assume (4). Then there exists a constant c>2d such that 
P{t k < r_i|y = 0) = (c - 1) • (c K+1 - I)" 1 < c~ K . 

Proof. Let Pi denote the law of the process starting at Yq = i. Then 
the transition probabilities of the process indicated in (16) imply that 

00 • I 2 r 

Pi{ri-i < r i+ i) = l 2 V(/ • r) J = - and Pi(ji-\ > r i+i ) = r 

1 — Ir 1 — It 

3=0 

for any i £ Z. Let pi = P%{tk < t_i) and = (l 2 /r) 1 . By decomposing 
according to whether the process first jumps to i — 1 or i + 1, and using the 
fact that 

(/ 2 + r)w(i) = Z 2 u;(i - 1) + rw(i + 1), 

we deduce that w(Y n ) is a martingale. The martingale stopping theorem 
then implies that 

w(i) =piw(K) + (1 -pi)w(-l) 

from which it follows that 

Pi = (w(i) - iw(-l)) • M^) - ^(-l))" 1 = (c' +1 - 1) • (c K+1 - ly 1 

with c = / 2 /r. This implies that Pq{tk < t_i) = po = (c — 1) • (c^ +1 — l) -1 . 
Finally, since 

I 2 It > 2d if and only if 4> 2 aa > I2d 2 4> bb (4> aa + 6d(f> bb ), 



if and only if <p aa > 6d ^1 + y 1 + 
which is condition (4) in Theorem 4; the lemma follows. □ 

5.3. Proof of Lemma 5.1. First of all, combining Lemma 5.3 with (17), 
we obtain 



P(l = (7(0)57(1)1 • • • >7(-^0) is an invasion path) < c 



-K 



with c > 2d given in Lemma 5.3. Now, let x € <9-Fl+i = -Fl+i \ Fl for some 
L > and denote the set of paths starting at x and ending in the neighbor- 
hood of by Q x . By the triangle inequality, for any 7 = (7(0), 7(1), . . . , j(K)) E 

KT— 1 

L< || 7 W -7(0)|| < £ || 7 (* + 1) -7(011 

i=0 
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that is, each path in Q x has length at least L. Furthermore, since each 
site has 2d nearest neighbors, the number of possible paths of length K is 
bounded by (2d) K . It follows that 

P (there exists 7 £ Q x such that 7 is an invasion path) 

00 

< P(there exists 7 £ 0^ such that 7 is an invasion path of length K) 

K=L 

00 

< £ (2d/c) K = ^— x (MA)*. 

K=L C ZCl 

Finally, since the birth of a gene of type b at site implies the existence of 
an invasion path starting at some site x £ Fn for the process starting with 
all sites in Fn occupied by individuals of genotype aa, by summing over all 
sites x £ Fn , we obtain 

Pf n (Tn < 00) 

< P (there exists 7 £ Q x such that 7 is an invasion path) 

x<£F N 
00 

< P (there exists 7 £ 0^ such that 7 is an invasion path) 

L=Nx£dF L+1 

oo 

< m + 3) d - (2L + l) d ) x x (2d/c) L 

L=N C m 

< C17 (2d/c) JV < Ci 7 exp(-ai 7 iV) 

for suitable C17 < 00 and 017 > 0. This completes the proof of Lemma 5.1. 

5.4. Proof of Lemma 5.2. Lemma 5.2 is a straightforward consequence 
of Lemma 5.1. Let e > and denote by T£ N the first time the cube F^n is 
void of genes of type 6, that is 

T3N = m f{^ > : rjt(x) = aa for all x € F$n}- 

By applying Lemma 5.1 to each of the sites belonging to 8F2N+1 at time 
T£ N , we obtain 

P(r)t(x) 7^ aa for some x £ at some time i > T^y | Tg^y < 00) 

< P(rjt(x) 7^ aa for some x € di^v+i at some time t > T£ N \ T£ N < 00) 

< ((2N + 3) d - (2N + l) d ) x Ci 7 exp(-ai 7 iV) 

< Ci 8 exp(-a 18 N) 
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for suitable C%s < oo and a is > 0. Pick N sufficiently large so that 

C\i exp(— a\jN) + C\% exp(— ais N) < e. 

Now that N is fixed, we will define a time T satisfying Lemma 5.2 as a 
function of N. First of all, we observe that if site is in state aa at some 
time t then there is a positive probability (uniform in t) that the cube F%n 
is void of genes of type b at time t + 1. This implies that 

P(T^ N <oo\T N = oo) = l. 

Since the sequence of events {T£ N < n}, n > 0, is nondecreasing, we can 
apply Beppo-Levi theorem to obtain the existence of a large enough deter- 
ministic time T < oo such that 

P(T£ N < T\T N = oo) = 1 - e. 

In conclusion, 
Pf n {j]t{x) 7^ aa for some x € F2n) 

< Pf n (Tn < oo) + P(t]t(x) aa for some x G i^JV! Tpj = oo) 

< P Fn (T n < oo) + ^(7^ = oo; T£ N > T) 

+ P(?]t{x) aa for some x G F2N',T£ N < T) 

< P Fn (T n < oo) + P(T N = oo; T* N > T) 

+ P{rjt{x) 7^ aa for some x 6 i^jy at some time t > T£ N ;T£ N < oo) 

< e + Cir exp(-ai 7 iV) + Cis exp(-ai 8 7V) < 2e. 
This completes the proof of Lemma 5.2 and Theorem 4. 

6. Proof of Theorem 6. Theorem 6 predicts convergence to the "all 
aa" configuration for a suitable choice of the parameters and for the 1- 
dimensional process starting from any initial configuration with infinitely 
many individuals of genotype aa. In Sections 6.1 and 6.2, we will prove for 
the process starting with only aa on the half line (— oo, 0] that lim^oo r t = oo 
where 

r t = sup{x G r L:r]t{z) = aa for all z < x}. 

This implies that (i) starting with a single individual of genotype aa at site 


lim Zr = — oo and lim Zf = +oo, 

t— >oo t—l-OO 

almost surely on the event {Z^ < < Zf for all t > 0} where 
Zf = inf{x < : f]t(z) = aa for all x < z < 0}, 
Z^~ = sup{x > : rjt (z) = aa for all x > z > 0}, 
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and that (ii) P(Zf < < Z+ for all t > 0) > C 19 > 0. From (i) and (ii), it 
is straightforward to deduce that, with probability 1, the process starting 
with infinitely many individuals of genotype aa converges to the "all aa" 
configuration. Coming back to our initial objective, we assume from now on 
that the process starts with only individuals of genotype aa in the interval 
(— oo, 0]. To show that lim^oo r% = oo, we first set Kt = st — rt — 1 where 

St = inf{x > r t + 1 : rj t {x) = bb} 

and observe that the process Kt evolves as indicated in Table 3. The proof 
is carried out in two steps according to whether cp aa > cpba or <f) aa < foa- 

6.1. 4> aa > tfiba- I n this case, the process r t has a positive drift when 
K t 7^ (see Table 3) and the proof consists in estimating the fraction of 
time Kt ^ when 

'raa > 4>bb + 4>ba + V4>bb4> ba ■ 

First of all, we observe that, since cj) aa > max(^>f, a , 4>bb)i the process K t jumps 
from I\ to at rate at most 2(j) aa , which implies that 

P(K t+h = 0) < P(K t = 0) x [1 - 2(<f> aa + <f> bb )h) 

+ P{K t ^0)x24> aa h + O{h 2 ). 

By first taking the limit as h — > and then the limit as t — > oo, it follows 
that 

- P(K t = 0) < 2<p aa - m aa + 2^bb)P{K t = 0) 

and then that 

limsupP^ = 0) < - ^ . 

t^oo 2(p aa + (pbb 



Table 3 

Evolution rules of Kt . In our table, Ik = {fc, k + 1, . . .} , o = state aa, • = state ab and 

• = state bb 



Configuration 


Transition 


Rate 




Drift (n) 


ooo» • ■ ■ 


0->Ji 


> 2(4>aa + <f>bb) 




> -~2<f>bb 


OOM • • • 


l->0 


4>aa + <f>bb 




> 4>aa — 4>ba 


oota • • • 


l-»Ja 


> (f>ab + 4>ba 




> 4>aa — 4>ba 


OOH • • • 


1 2 ->0 


4>ba/2 


> 


t>aa + 4>ab/2 — 4*ba 


oo«o • • • 




< 2<t>aa 




> 4:<fraa — 4>ba 


oo»» • ■ ■ 


h->l 


< <f>aa + 4>bb + fiab/Z + 4>ba/2 


> 


paa + <f>ab/2 - 4>ba 


oo«o • • • 


h -> 1 







> 40 aa — (f>ba 
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Since the drift is bounded from below by cp aa — 4> ba when K t ^ (see Table 
3), we obtain 

liminf ^ > -2^ limsupP^t = 0) + {<p aa - ^ ba ) liminf P(K t + 0) 

t — >oo t t too ^ — >cc> 

\ o, 4>aa , , , , N <Paa + <Pbb 



2</>aa + <Afe6 20 aa + (j) bb 

A, 

2<j) bb + ((j) aa - (j) ba )(l + (t>bb/<t>aa)} 



2<t>aa + <t>bb 



9aa ~ (<Pbb + 4>ba) 



24>aa + <f>bb 

This leads to the requirement that <^ Q — (4> bb + 4>ba)(l>aa — ^bb^ba > 0, which 
is equivalent to 



<\>aa > \{<\>bb + 4>ba) + \J ^bb^ba + \{<t>bb + <t>ba) 2 ■ 

Since, by subadditivity of the function square root, 

\J §bb4>ba + l(4>bb + <t>ba) 2 < V 4>bb4>ba + \ (4>bb + 4>ba) , 

this shows that condition (5) implies that \\m t ->ooi't = oo. 

6.2. 4> aa < 4> ba . The process r t may have a negative drift when K t = 1 
but has a positive drift when K% > 2 so we compare the fractions of time 
Kt = l and Kt>2. We start by proving the result when 

4>aa > max^^y^:, (p ba - and <j) bb = 0. 

In this case, the drift when Kt = is nonnegative (the worst case is drift = 
0) so we shall think of state as a "neutral" state, state 1 as a "bad" state 
and states in I2 as "good" states. We will construct a Markov process Xf 
with state space {0, 1,2} such that 

limsu Pt P(ifr = 1) < lirm P(X t = 1) 



liminf t P(K t > 2) - lim* P(X t = 2) 
Noticing that: 

1. K t jumps from 1 to I2 at rate at least cp ab + (p ba , 

2. K t jumps from I2 to {0, 1} at rate at most (j) aa + (p a b/^ + <t>ba, 

and having in mind that is neutral, 1 is a bad state and 2 is a good state, 
we impose that: 

1. Xt cannot jump from to 2 but jumps from to 1 at some positive rate, 
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2. Xf jumps from 1 to 2 at rate (f) a b + 4>ba, 

3. X t jumps from 2 to {0, 1} at rate cp aa + 4> a b/2 + <?W 

The process Xt describes a "worst case" scenario in which the time spent in 
bad states is rather large and the time spent in good states is rather small 
compared to Kt so that (18) holds (note that the time spent in state is 
unimportant in this inequality). Moreover, regardless of the specific values 
of the other transition rates of the process Xt, the conditions above imply 
that the stationary distribution ir = (ttq, tt\, 1T2) of Xt satisfies 

• 7T + {(f>ab + 4>ba) ' TTl - (<f>aa + 4>ab/2 + 4>ba) • VT 2 = 0. 

In particular, we can conclude that 

limsupjP^t = 1) limsupji^A'i = 1) 



liminf t P(K t > 2) ~ liminf t P(X t = 2) 

_ lim f P(X t = 1) _ 7Ti _ (j>aa + (f>ab/'2 + (f>ba 
limt P(X t = 2) 7T 2 4>ab + 4>ba 

which we rewrite as 

(4>ab + <t>ba) x limsupP(ET t = 1) 



(19) 



t— >oo 



< (<Paa + + <&a) X liminf P(tf t > 2). 



To conclude the proof, we now distinguish two cases. 

1. If 4> (l a < (l/6)0 a 6 then (19) implies that 

limsup P(K t = 1) < liminf P(K t > 2). 

Moreover, the drift when Kt > 2 is larger than 40 aa — and the rate 
at which the gap process jumps from is strictly positive, so there 
exists C20 > such that 

ft 

liminf — > C 2 o(4'aa - ha) limsup P(K t = 1) 

t-*oo t t-*oo 

+ C 20 m aa - ha) liminf > 2) 

1^00 

> C 20 (5ha ~ 2<p ba )limsup P(K t = 1) > 

since ha > (2/ 5) ha- 

2. If 4> aa > (l/6)(/> a fe, then, by using the fact that 4> aa < 4>ba together with 
(19), we obtain 

limsupP(^ = 1) < 2 x liminf P(K t > 2). 
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Moreover, the drift when K t > 2 is larger than (f> aa + (1/2)^ — fo a and 
the rate at which the gap process Kt jumps from is strictly positive, so 
there exists C 2 \ > such that 
ft 

liminf — > C 2 i(foa - foa) limsupP(i^t = 1) 

+ C 21 (fo a + (1/2)006 - M lirninf P(# t > 2) 

t— >oo 

> C 21 (3/2)(fo a + (1/6)^ - 06a)limsup P(K t = 1) > 

t— »oo 

since fo a > foa - (l/6)fo b . 

This establishes the result in the case when fob = 0. In addition, our proof 
implies that, properly rescaled in space and time, the process can be embed- 
ded in a supercritical oriented percolation process (we omit the details of 
the proof) which allows us to apply a perturbation argument similar to the 
one in the proof of Theorem 1 and extend the result to a set of parameters 
where fob is small but positive. This completes the proof of Theorem 6. 
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